Benchmark thermochemistry of the C n H2 n +2 alkane isomers 
(n=2— 8) and performance of DFT and composite ab initio 
methods for dispersion-driven isomeric equilibria* 

Amir Karton, David Gruzman, and Jan M. L. Martin* 
Department of Organic Chemistry, Weizmann 
Institute of Science, IL-76100 Rehovot, Israel 
(Dated: J. Phys. Chem. A MS jp-2009-04369h: Received May 11, 2009; Accepted May 15, 2009) 



* Dedicated to Prof. Yitzhak Apeloig on the occasion of his 65th birthday 



1 



Abstract 

The thermochemistry of linear and branched alkanes with up to eight carbons has been 
reexamined by means of W4, W3.21ite and Wlh theories. 'Quasi- W4' atomization energies have 
been obtained via isodesmic and hypohomodesmotic reactions. Our best atomization energies at 
K (in kcal/mol) are: 1220.04 re-butane, 1497.01 re-pentane, 1774.15 n-hexane, 2051.17 re-heptane, 
2328.30 re-octane, 1221.73 isobutane, 1498.27 isopentane, 1501.01 neopentane, 1775.22 isohexane, 
1774.61 3-methylpentane, 1775.67 diisopropyl, 1777.27 neohexane, 2052.43 isoheptane, 2054.41 
neoheptane, 2330.67 isooctane, and 2330.81 hexamethylethane. Our best estimates for Ai/j 298K 
are: -30.00 n-butane, -34.84 n-pentane, -39.84 re-hexane, -44.74 re-heptane, -49.71 n-octane, -32.01 
isobutane, -36.49 isopentane, -39.69 neopentane, -41.42 isohexane, -40.72 3-methylpentane, -42.08 
diisopropyl, -43.77 neohexane, -46.43 isoheptane, -48.84 neoheptane, -53.29 isooctane, and -53.68 
hexamethylethane. These are in excellent agreement (typically better than 1 kJ/mol) with the 
experimental heats of formation at 298 K obtained from the CCCBDB and/or NIST Chemistry 
WebBook databases. However, at K a large discrepancy between theory and experiment (1.1 
kcal/mol) is observed for only neopentane. This deviation is mainly due to the erroneous heat 
content function for neopentane used in calculating the K CCCBDB value. The thermochemistry 
of these systems, especially of the larger alkanes, is an extremely difficult test for density functional 
methods. A posteriori corrections for dispersion are essential. Particularly for the atomization 
energies, the B2GP-PLYP and B2K-PLYP double-hybrids, and the PW6B95 hybrid-meta GGA 
clearly outperform other DFT functionals. 



^Electronic address: gershom@wcizmann.ac.il 



I. INTRODUCTION 



The fundamental importance of alkanes as organic chemistry building blocks and in 
industrial chemistry (particularly petrochemistry) is self-evident to any chemist. 

Linear and branched lower alkanes are the principal components of gaseous and liquid 
fossil fuels. Accurate knowledge of their thermodynamic properties is essential for reliable 
computational modeling of combustion processes. (We note that one of us, JMLM, is a 
member of a IUPAC task group working in this area. [l|) 

Aside from their practical relevance, alkanes present some intriguing methodological 
issues. The importance of accurate zero-point vibrational energies and diagonal Born- 
Oppenheimer corrections has been discussed previously^, and this applies to both 
wavefunction ab initio and density functional methods. While post-CCSD(T) computational 



thermochemistry methods like W4 theory 



Q, Q or HEAT0, fl, Q 



have no trouble dealing 



with systems that, from an electronic structure point of view, are much more taxing than 
alkanes, their steep cost scaling makes application to higher alkanes (or higher hydrocarbons 
in general) impractical at present. 



Density functional theory seems to 



number of authors 



flfl 
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3, 



3e the obvious alternative. However, in recent years a 



14 



15| have pointed to a disturbing phenomenon 16| : 



the error in computed atomization energies of n-alkanes grows in direct proportion to 
the chain length. In addition, these same authors found that popular DFT methods 
have significant problems with hydrocarbon isomerization energies in general, and alkane 
isomerization energies in particular. This latter problem appears to be related to the poor 
description of dispersion by most DFT functionals and can be remedied to a large extent by 



empirical dispersion corrections 



111. 



For molecules as chemically systematic as alkanes, a computationally more cost-effective 
approach than brute-force atomization energy calculations is the use of bond separation 
reactions, such as isodesmic 17] and homodesmoticfl8| reactions. Recently, Schleyer and 
coworkers jl^l. 2(| discussed the concepts of 'protobranching' and of 'hypohomodesmotic 
reactions', i.e., reactions which, in addition to being isodesmic (i.e., conserving numbers 
of each formal bond type), conserve the number of C atoms in each hybridization state and 
hapticity (primary, secondary, tertiary, quaternary). The latter is a refinement of the earlier 
'homodesmotic reaction' concept 181 ] . 
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They established a consistent hierarchy of hydrocarbon reaction types that successively 
conserve larger molecular fragments: atomization D isogyric D isodesmic D hypohomod- 
esmotic D homodesmotic D hyperhomodesmotic which provides a converging sequence in 
the sense that the energetic components of the reaction cancels to a larger extent between 
reactants and products as the reaction hierarchy is traversed. 

In the present work we obtain 'quasi- W4' atomization energies for C4-C8 alkanes through 
the use of isodesmic and hypohomodesmotic reaction cycles that involve only methane, 
ethane, and propane in addition to one larger alkane. The reaction energies are calculated 
at the W3.21ite or Wlh levels, while for methane, ethane, and propane W4 benchmark 
values are used. We shall show that the reaction energies of hypohomodesmotic reactions and 
judiciously selected isodesmic reactions are well converged even at the Wlh level. The use of 
hypohomodesmotic reactions leads to near-perfect cancellation of valence correlation effects, 
and the use of judiciously selected isodesmic reactions leads to near-perfect cancellation of 
post-CCSD(T) correlation effects. 

We will then proceed to evaluate a number of DFT functionals and composite ab initio 
thermochemistry methods against the reference values obtained, both for the atomization 
and for the isomerization energies. 



II. COMPUTATIONAL METHODS 



All calculations were carried out on the Linux cluster of the Martin group at Weizmann. 



DFT geometr y op timizations were carried out using Gaussian 03 Revision E.01 
B3LYP 



22 



23 



2l|. The 



■ i24[| DFT hybrid exchange-correlation (XC) functional was used in conjunction 
with the pc-2[25[ polarization consistent basis set of Jensen. All large-scale self-consistent 
field (SCF), CCSD and CCSD(T) calculations 26 .j27 |were carried out with the correlation 



consistent family of Dunning and coworkers 



MOLPRO 



2JH 



30 



31 



321 ] using version 2006.1 of the 



program system. All single-point post-CCSD(T) calculations were carried out 



using an OpenMP-parallel version of M . Kallay's general coupled cluster code MRCC 341 ] 



interfaced to the Austin-Mainz-Budapest version of the ACES II program system [351] . The 
diagonal Born-Oppenheimer correction (DBOC) calculations were carried out using the 
CFOUR program system 36]. 



The computational protocols of Wnh theories Wl [37|, [38[ , W3.21ite[39|], and W4[3[ used in 
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the present study have been specified and rationalized in great detail elsewhere {3, |37|, |38|, [39]. 
(Throughout, W3.21ite refers to variant W3.21ite(c) as described in ref. 39j.) The use of the 
Wnh variants of the Wn methods, in which the diffuse functions are omitted from carbon and 



less electronegative elements, is of no thermochemical consequence for neutral alkanes 
but computer resource requirements are substantially reduced. 

For the sake of making the paper self-contained, we will briefly outline the various steps 
in W3.21ite theory and in W4h theory for first-row elements: 

• reference geometry and ZPVE correction are obtained at the B3LYP/pc-2 level of 
theory for W3.21ite, and at the CCSD(T)/cc-pVQZ level for W4h. 

• the ROHF-SCF contribution is ex trap olated using the Karton-Martin modification 
of Jensen's extrapolation formula 41] : 



4Q| 



E UF ,l = £hf,oo + A(L + 1) exp(-9v / Z) 



(1) 



For W3.21ite and W4h, the extrapolations are done from the cc-pV{Q,5}Z and cc- 
pV{5,6}Z basis set pairs, respectively 

the RCCSD valence correlation energy is extrapolated from these same basis sets. 
Following the suggestion of Klopper|42|, £corr,RccsD is partitioned in singlet-coupled 
pair energies, triplet-coupled pair energies, and T\ terms. The 7\ term (which exhibits 
very weak basis set dependence) is simply set equal to that in the largest basis set, while 
the singlet-coupled and triplet-coupled pair energies are extrapolated using A + B/L a 
with «s=3 and «t=5. 

the (T) valence correlation energy is extrapolated from the cc-pV{T,Q}Z basis set 
pair for W3.21ite, and from cc-pV{Q,5}Z for W4h. For open-shell systems, the 
Werner-Knowles-Hampel (a.k.a., MOLPRO) definition 43j of the restricted open-shell 
CCSD(T) energy is employed throughout, rather than the original Watts-Gauss- 
Bartlett[27| (a.k.a. ACES II) definition. 

the CCSDT— CCSD(T) difference, T3 — (T), in W3.21ite is obtained from the empirical 
expression 2.6xcc-pVTZ(no / ld)(no p on H)-1.6xcc-pVDZ(no p on H), where the 
CCSDT energy is calculated using ACES II. In W4h it is instead extrapolated using 
A + B/L 3 from cc-pV{D,T}Z basis sets. 
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• the difference between ACES II and MOLPRO definitions of the valence RCCSD(T) 
definition is extrapolated from cc-pVDZ and cc-pVTZ basis sets. One-half of this 
contribution is added to the final result, as discussed in the appendix of ref. ^j]. 

• post-CCSDT contributions in W3.21ite are estimated from UCCSDT(Q)/cc-pVDZ(no 
p on H)-UCCSDT/cc-pVDZ(no p on H) scaled by 1.1. In W4h, the connected 
quadruples are obtained as l.lx[UCCSDT(Q)/cc-pVTZ -■ UCCSDT/cc-pVTZ + 
UCCSDTQ/cc-pVDZ - UCCSDT(Q)/cc-pVDZ], while the contribution of connected 
quintuple excitations is evaluated at the CCSDTQ5/cc-pVDZ(no d) level. 

• the inner-shell correlation contribution, in both cases, is extrapolated from 
RCCSD(T)/cc-pwCVTZ and RCCSD(T)/cc-pwCVQZ calculations. 

• the scalar relativistic contribution, again in both cases, is obtained from the 
difference between nonrelativistic RCCSD(T)/cc-pVQZ and second-order Douglas- 
Kroll RCCSD(T)/DK-cc-pVQZ calculations. 

• atomic spin-orbit coupling terms are taken from the experimental fine structure. 

• finally, a diagonal Born-Oppenheimer correction (DBOC) is obtained at the ROHF/cc- 
pVTZ level. 

The main changes in Wlh relative to W3.21ite are: (a) the SCF component is extrapolated 
from the cc-pV{T,Q}Z basis sets, using the formula A + B/L 5 ; (b) The valence RCCSD 
component is extrapolated from the same basis sets, using A + B/L 3 ' 22 ; (c) the valence 
parenthetical triples, (T), component is extrapolated from cc-pV{D,T}Z basis sets, using A+ 
B/L 3 ' 22 ; (d) inner-shell correlation contributions are evaluated at the CCSD(T)/MTsmall 
level; and (e) post-CCSD(T) correlation effects as well as the DBOC are completely 
neglected. 

The CCSDTQ5/cc-pVDZ(no d) calculation for propane proved to be too taxing even for 
our strongest machine (8-core, Intel Cloverton 2.66 GHz, with 32 GB of RAM). For the 
alkanes for which we do have this term, CH4 and C2H6, it is practically zero (0.00 and 0.01 
kcal/mol, respectively), therefore for propane it was safely neglected. 



The anharmonic zero-point vibrational energy (ZPV 



and allene was calculated using the following equation 44 ] 



of propane, propene, propyne, 



zpve _ - ^ uh - - ^ — - - y Ui + ujj + Uk + T2 L + 

6 



where the cubic, quartic, and kinetic energy terms were computed at the MP2/cc-pVTZ level 
of theory, and the harmonic term was partitioned into valence and core- valence contributions 
which were calculated at the CCSD(T)/cc-pVQZ and CCSD(T)/MTsmall levels of theory, 
respectively. (For propane we resorted to a CCSD(T)/cc-pVTZ calculation since the 
CCSD(T)/cc-pVQZ proved too daunting; based on the results for the other systems this 
is expected to have little effect, e.g., the differences between the harmonic ZPVE calculated 
with the two basis sets are: 0.02, 0.02, 0.02, and 0.01 kcal/mol for methane, ethane, propene, 
and allene, respectively.) 

Unless noted otherwise, experimental data for the heats of formation at K were 
taken from the NIST Computational Chemistry Comparison and Benchmark Database 
(CCCBDB) 45|] • The atomization energies quoted in CCCBDB assume CODATA 46J values 
for the atomic heats of formation at K: however, particularly for carbon atom, the 
ATcT value|47j (170.055±0.026 kcal/mol) is significantly higher than the CODATA value 
(169.98±0.11 kcal/mol), consequently, using the ATcT value in converting AHj 0K to 
atomization energy raises the atomization energy over the CCCBDB value by m x 0.075 
kcal/mol for a system with m carbon atoms. Thus, throughout the paper, the experimental 
TAE were obtained from the heats of formation at K using ATcT atomic heats of 
formation at K (C 170.055±0.026, and H 51.633±0.000 kcal/mol) [|7|. In cases where 
only experimental heats of formation at 298 K are available from CCCBDB (n-heptane, 
n-octane, isoheptane, and isooctane), they were first converted to K using ifoaa, — Hq 
for H 2 (g) 2.024±0.000, and C(cr,graphite) 0.251±0.005 kcal/mol from CODATA^, and 
the molecular heat content functions from the TRC (Thermodynamic Research Center) 



tables 



48 



which are the source of virtually all CCCBDB enthalpy functions for the species 
considered in the present paper. 

In order to facilitate direct comparison with experiment, we have also converted our 
calculated atomization energies to heats of formation at room temperature, AHj 2 gsK- 
Rather than mix our calculated atomization energies with the TRC enthalpy functions, 
we have calculated our own i?298 — -^o for the alkanes. The translational, rotational, and 
vibrational contributions were obtained within the RRHO (rigid rotor-harmonic oscillator) 
approximation from the B3LYP/pc-2 calculated geometry and harmonic frequencies. 



Internal rotation corrections were obtained using the Ayala-Schlegel method 



49j , again on the 



B3LYP/pc-2 potential surface. This leaves us with the issue of correcting for the ensemble 
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of low-lying conformers of the alkanes: by way of illustration, n-butane through n-octane 
have 2, 4, 12, 30, and 96 unique conformers, respectively. The relative energies of these 
conformers (which are surprisingly sensitive to the level of theory as they are strongly driven 
by dispersion) were the subject of a recent benchmark study by our group 50J. While large 
basis set CCSD(T) calculations for all conformers of the heptanes and octanes proved too 

n 

costly (primarily for those without any symmetry), it was found in Ref. [50j that the B2K- 
PLYP-D double- hybrid functional 51J with an empirical dispersion correction, in conjunction 
with a sufficiently large basis set, tracks the CCSD(T) reference datajscj] for n-butane, n- 
pentane, and n-hexane exceedingly closely, and this is the approach we have followed for all 
systems with more than one conformer in this work. 

Dispersion corrections for the DFT energies (denoted by the suffix "-D") were applied 



using our implementation of Grimme's expression [52 



53]. 



N a t-l N at 

lj disp — — Sq ^j>2 ^ "^g"/dmp (Rij 
i=l j=i+l U ij 

where the damping function is taken as 



/dmp \Ri. 



1 + exp 



SRRr 



1 



-i 



(3) 



(4) 



and Ci « \IC\Pl Rr 



Rv<w,i + Ry<w,j is the sum of the van der Waals radii of the two 
atoms in question, and the specific numerical values for the atomic Lennard- Jones constants 
Cg and the van der Waals radii have been taken from Ref. 
sr=1.0 and hysteresis exponent a=20.0 as per Ref. 



53) 



521 ]. whereas the length scaling 



Eq- (EI) has a sing 



taken from Refs. 



52 



e functional-dependent parameter, namely the prefactor sq. This was 
53] for BLYP, B3LYP, and PBE, from Ref. 54] for the double hybrids, 
and optimized in the present work for the remaining functionals. These were, for the most 
part, optimized against the S22 benchmark set of weakly interacting systems 55]. 



III. RESULTS AND DISCUSSION 



A. Overview of diagnostics for nondynamical correlation 



The percentages of the nonrelativistic, clamped-nuclei Wlh total atomization energy 
at the bottom of the well (TAE e ) accounted for by SCF, and (T) triples contributions 



S 



are reported in Table SI of the Supporting Information, together with the coupled cluster 



Ti and T>\ diagnostics 



56 



571 ]. and the largest T 2 amplitudes. The percentage of the total 



atomization energy accounted for by parenthetical connected triple excitations, %TAE e [(T)], 
has been shown to be a reliable energy-based diagnostic for the importance of nondynamical 
correlation effects Q. Ref.fl gives useful criteria for assessing the extent of nondynamical 
correlation effects, e.g., %TAE e [SCF]>67% and/or %TAE e [(T)]<2% indicate systems that 
are dominated by dynamical correlation. 

As expected, all the alkanes considered in the present study exhibit very mild 
nondynamical correlation effects, and can be regarded as dominated by dynamical 
correlation. 77-79% of the atomization energy is accounted for at the Hartree-Fock level, 
and 0.7-1.3% by the (T) triples. Table SI (see Supporting Information) shows that the 
%TAE e [(T)] slightly increases with the degree of branching. 

In systems with very mild nondynamical correlation, CCSD(T) is generally very close 
to the full CI limit, as the higher order triple excitations, T3— (T), and the connected 
quadruple excitations, T4, tend to largely cancel one another (they are of similar orders of 
magnitude, but connected quadruples universally increase atomization energy while higher- 
order triples generally decrease it[3j, |4j). For the smaller systems (of up to five carbons), for 
which we were able to explicitly calculate the T 3 — (T) and (Q) contributions, the percentage 
of the atomization energy accounted for by post-CCSD(T) excitations, %TAE[post-(T)], 
varies between -0.01 and -0.02%. Small as these numbers may seem in relative terms, 
the atomization energy for an alkane with more than three carbons already exceeds 1,200 
kcal/mol, 0.02% of which amounts to 1 kJ/mol. 



B. Linear alkanes 



The gas phase heat of formation (viz. the atomization energy) of an arbitrary linear 
alkane can be obtained from the following hypohomodesmotic reaction involving ethane and 
propane. 

C m H 2m+2 + (m - 3)C 2 H 6 -> (m - 2)C 3 H 8 (5) 

As the atomization energy of ethane is very well established both by W4 theory and by 
ATcT, it is highly desirable to also have a W4 reference value for propane. A component 
breakdown of the W4 atomization energies for methane, ethane, and propane is given in 
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Table [H while the final atomization energies at K are compared with ATcT and CCCBDB 
experimental values in Table [U There is good agreement between W4 and the available 
ATcT values (to within the sum of the uncertainties). 

W3.21ite component breakdowns from methane, ethane, propane, n-butane, and n- 
pentane (as well as for the relevant hypohomodesmotic reactions) are given in Table [III 
Let us briefly consider the raw components of the W3.21ite atomization energies. First, 
we note that there is a perfect linear relationship (R 2 >0.997) between all the components 
of the atomization energies and the number of carbons in the linear alkanes. The T 3 — (T) 
contribution reduces the atomization energies by amounts ranging form 0.12 kcal/mol in 
methane to 1.07 kcal/mol in n-pentane, and the (Q) contribution, which increases the 
atomization energy, ranges from 0.09 kcal/mol in methane to 0.79 kcal/mol in n-pentane. 
The overall post-CCSD(T) contributions, which increase linearly with the size of the system, 
reduce the atomization energies by 0.03-0.3 kcal/mol. We finally note that, while the directly 
computed W3.21ite total atomization energies in the hypothetical motionless state ("at the 
bottom of the well") agree well with the available W4 data (to within 0.2 kcal/mol or better), 
agreement at K is rather less pleasing (differences of 0.3, 0.6, and 0.7 kcal/mol are seen 
for methane, ethane, and propane, respectively). The chief part of these differences (0.2, 
0.4, and 0.5 kcal/mol, respectively) comes from the comparatively low-level approximation 
used for the zero-point vibrational energies: in molecules like n-pentane, ZPVE reaches 
the 100 kcal/mol regime, and even a 1% error due to neglect of explicit anharmonicity 
will translate into a 1 kcal/mol error in the final ZPVE. We made the point earlier 2| 
that, at least for species containing many hydrogens, the factor limiting accuracy of Wn 
and similar thermochemical protocols will increasingly be the quality of the zero-point 
vibrational energy. Nevertheless, for the ' quasi- W4' data obtained in the present paper 
from hypohomodesmotic or isodesmic cycles, the ZPVEs that enter the final results are the 
very accurately known ZPVEs of methane, ethane, and propane on the one hand, and the 
small ZPVE component (nearly two orders of magnitude smaller than for the brute-force 
TAE calculation) of the hypohomodesmotic or isodesmic reaction energy on the other hand. 
The relatively low- level approximation to the latter simply cannot wreak as much " damage" 
in the latter case. 

Turning to the hypohomodesmotic reactions of n-butane and n-pentane, the most 
striking feature of Table [III is the near-perfect cancellation of all the valence correlation 
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contributions between reactants and products. For n-butane, the CCSD, (T), T— (T), and 
(Q) contributions to the hypohomodesmotic reaction energy amount to merely, -0.03, - 
0.03, -0.01, and 0.00 kcal/mol, respectively. While for the hypohomodesmotic reaction 
involving n-pentane they amount to: +0.02, -0.02, -0.03, and +0.01 kcal/mol, respectively. 
The scalar relativistic and DBOC contributions are, likewise, basically null. The dominant 
contributions to the reaction energies come from the SCF, inner-shell, and ZPVE components 
(specifically, +0.20, +0.07, and +0.13 kcal/mol, respectively, for the n-butane reaction, and 
+0.21, +0.11, and +0.22 kcal/mol, respectively, for the n-pentane reaction). Overall, these 
hypohomodesmotic reactions are very slightly endothermic; the reaction energies at K are 
0.32 and 0.52 kcal/mol for n-butane and n-pentane, respectively. 

Having established that valence post-CCSD(T) and DBOC contributions to the 
hypohomodesmotic reaction energies are thermochemically negligible, we proceed to 
calculate the atomization energies of larger n-alkanes from Wlh hypohomodesmotic reaction 
energies. Wlh component breakdowns from linear alkanes up to n-octane (as well as for 
the relevant hypohomodesmotic reactions) are gathered in Table S2 of the Supporting 
Information. First, let us consider the components of the atomization energies for the 
five species for which a comparison with W3.21ite can be made. The SCF component, 
extrapolated from the cc-pV{T,Q}Z basis sets, is well converged. The valence CCSD 
contribution, extrapolated from the same basis sets, systematically overestimates the cc- 
pV{Q,5}Z results (by 0.4, 0.5, 0.7, 0.9, and 1.1 kcal/mol for methane, ethane, propane, 
n-butane, and n-pentane, respectively). These differences are partially compensated 
by the fact that the Wlh inner-shell contribution calculated with the MTsmall basis 
set systematically underestimates the cc-pwCV{T,Q}Z results (by ~0.1 kcal/mol for 
methane and ethane, and by ~0.2 kcal/mol for the larger alkanes). The valence (T) 
component, extrapolated from the cc-pV{D,T}Z basis sets, is reasonably converged — the 
largest deviation (of 0.1 kcal/mol) from the cc-pV{T,Q}Z results is seen for neopentane. 
Overall, the Wlh atomization energies at K overestimate the W3.21ite values by 0.2, 
0.3, 0.4, 0.6, and 0.7 kcal/mol for methane, ethane, propane, n-butane, and n-pentane, 
respectively. This demonstrates the limitations of directly computing TAEs with lower level 
compound thermochemistry methods such as Wlh without the aid of hypohomodesmotic or 
isodesmic cycles. (We note that most of the difference comes from basis set incompleteness 
in the Wlh valence CCSD contribution, and that its nefarious effects is actually mitigated 
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by error compensation with neglect of DBOC and post-CCSD(T) components. By way 
of illustration, the W2.2h numbers for n-butane and n-pentane are 1220.71 and 1497.75 
kcal/mol, respectively, very close to W3.21ite.) 

Turning to the Wlh components of the hypohomodesmotic reaction energies (Table S2, 
Supporting Information) the SCF, valence CCSD, valence (T), and inner-shell contributions 
to the hypohomodesmotic reaction energies are very similar to their counterparts at the 
W3.21ite level. The overall reaction energies at K differ by merely 0.02 and 0.03 kcal/mol for 
the n-butane and n-pentane hypohomodesmotic reactions, indicating that the basis sets used 
in Wlh are sufficiently large to ensure adequate convergence of these hypohomodesmotic 
reaction energies. For the larger n-alkanes, again we see that the valence (T) contribution 
practically cancels out between reactants and products. The valence CCSD contribution to 
the reaction energy becomes thermochemically significant for the larger n-alkanes, reaching 
0.30 kcal/mol for the n-octane reaction. The overall reaction energies at K range from 
0.34 to 1.50 kcal/mol for the n-butane and n-octane reactions, respectively. 

Using W4 atomization energies for ethane and propane and assuming that the 
hypohomodesmotic reaction energies stay constant between Wlh and W4 theories, we 
obtain 'quasi- W4' atomization energies. Table II I II compares the 'quasi- W4' atomization 



energies obtained 
from CCCBDB 



rom W3.21ite and Wlh reaction energies with experimental data taken 
4=51 ] (adjusted for the revised, ATcT, heat of formation of carbon atom). 
First of all, for the species where such a comparison can be made, the ' quasi- W4' values 
obtained from W3.21ite and Wlh reaction energies are in very close agreement with each 
other. The hypohomodesmotic 'quasi- W4' and adjusted CCCBDB values agree very well 
(0.06-0.32 kcal/mol of one another, where the largest deviations are seen for n-pentane). 

Finally, we note that the Wlh TAE increasingly overestimate the 'quasi-W4' TAE 
(by 1.3-2.0 kcal/mol) on going from n-butane to n-octane, and even the W3.21ite 
TAEo for n-butane and n-pentane overestimate the 'quasi-W4' TAEo by 0.7 and 0.8 
kcal/mol, respectively. These large differences demonstrate the obvious advantage of using 
hypohomosesmotic reactions rather than atomization reactions when one is limited to 
comparatively low-level compound thermochemistry methods such as Wlh. The difference 
between the W3.21ite and best values mostly derives from neglect of explicit anharmonicity 
in the ZPVE. (The use of scale factors, of course, to some degree accounts implicitly for 
anharmonicity. ) 
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C. Branched alkanes 

In the nature of things, for an arbitrary alkane, there are more isodesmic reactions 
involving small hydrocarbon prototypes then there are hypohomodesmotic ones, rendering 
the former more useful for thermochemical applications. If we consider only prototypes for 
which we have explicit W4 reference values (namely, methane, ethane, and propane) then 
for an arbitrary alkane two linearly-independent isodesmic reactions are: reaction El and the 
bond-separation reaction El 



Table [IT] gives W3.21ite component breakdowns from isobutane and neopentane (as well 
as for the said isodesmic reaction energies). We note that for both isodesmic reactions the 
post-CCSD(T) contributions are not only fairly small, but partly cancel out between higher- 
order triples and connected quadruples, something more pronounced for reaction [5] than for 
reaction El 

Table S2 of the Supporting Information gives Wlh total atomization energies component 
breakdowns from branched alkanes of up to eight carbons and Table S3 of the Supporting 
Information gives Wlh component breakdowns from the said isodesmic reaction energies. 
As was the case for the hypohomodesmotic reactions, the Wlh components of reaction 
[5] are practically identical to their counterparts at the W3.21ite level, indicating that the 
basis sets used in Wlh theory are large enough to ensure convergence of the components 
of this isodemic reaction. The components of the bond-separation reaction El are not as 
similar to their counterparts at the W3.21ite level — the largest deviations are seen for the 
CCSD component (namely, 0.14 and 0.20 kcal/mol for the reactions involving isobutane and 
neopentane). Again we find that the DBOC contributions, which are neglected in Wlh, are 
fairly small, something more pronounced for reaction [5] than for reaction El 

Perusing the energy components of reactions El and El in Table S3 (see Supporting 
Information), several systematic features emerge: (a) the valence CCSD and (T) components 
of reaction [5] are substantially lower than those of reaction |6} (b) inner-shell correlation 
effects are somewhat more pronounced in reaction [5] than in reaction El (c) scalar relativistic 
and DBOC contributions are fairly small, but in contrast to reaction El in reaction El they 
systematically increase with the size of the alkane; and (e) the ZPVE contribution to reaction 
El is substantially lower than to reaction El 




(6) 
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In effect, the energy components (except for inner-shell) of reaction [5] cancel out to a larger 
extent between reactants and products than in the bond-separation reaction [HI Overall, the 
reaction energies at K for reaction [5] are considerably lower (by 6-17 kcal/mol) than those 
of reaction EJ The superiority of reaction [5] is ascribed to the fact that the numbers of CH2 
and CH3 groups are roughly equal on both sides of the reaction (with the notable exception 
of hexamethylethane, a.k.a. 2,2,3,3-tetramethylbutane), while in reaction [U] the difference 
in CH 2 and CH 3 groups between reactants and products increases systematically with the 
size of the alkane. Furthermore, in reaction [3] the number of 1,3-interactions is roughly 
equal on both sides, while in reaction [6] 1,3-interactions occur only on the left-hand-side, 
and thus the imbalance increases systematically with the size of the alkane. The difference 
between the two reactions is further emphasized when considering the linear alkanes for 
which reaction is hypohomodesmotic, and thus the numbers of CH 2 groups, CH 3 groups, 
and 1,3-interactions are perfectly balanced on both sides of the reaction. For example, for n- 
octane, the SCF, CCSD, and (T) components of reaction [6] are 4.79, 6.87, and 1.56 kcal/mol, 
while for reaction [5] they are merely 0.27, 0.30, and 0.03 kcal/mol, respectively (see Table 
S2 of the Supporting Information). 

From the linearly independent reactions [5] and [6] we can easily construct new isodesmic 
reactions — any linear combination of isodesmic reactions is also isodesmic — in which the 
number of 1,3-interactions, CH 2 groups, or CH 3 groups are perfectly balanced on both 
sides of the reaction. In the remainder of the paper, these are referred to as (1,3), (CH2), 
and (CH3) reactions, respectively. The question that naturally arises is: which of the 
five isodemic reactions should be used to derive the ' quasi- W4' atomization energy of 
the branched alkanes? As far as the electronic structure is concerned, we can divide the 
question into two parts: convergence of the n-particle space ("correlation treatment") and 
one-particle space ("basis set"). (In the present study we are not analyzing errors arising 
from anharmonic corrections to the ZPVE.) 

We have already seen for isobutane and neopentane (Table [II]) that convergence of the 
n-particle space is faster in reaction [5] than in reaction [6j Table [III also shows that the 
isodesmic reaction that balances the 1,3-interactions exhibits the fastest convergence, i.e., 
the T 3 — (T) and (Q) contributions are 0.00 and 0.00 kcal/mol for the isobutane reaction, 
and -0.01 and -0.02 kcal/mol for the neopentane reaction. 

Convergence of the one-particle space can be explored in a more systematic manner. 
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Table S4 (given in the Supporting Information) shows the basis set convergence of the 
SCF, CCSD, and (T) components of the said isodesmic reactions. In general, the SCF 
component converges quite rapidly with the basis set size, for all the isodesmic reactions 
the cc-pV{D,T}Z results are close to the basis set limit, something more pronounced for 
reactions [5] and (1,3) (we note that reaction [6] exhibits anomalous convergence for some of 
the systems). Convergence of the valence CCSD component becomes markedly slower. For 
most of the systems the isodesmic reactions E] and (1,3) the cc-pVTZ basis set yields results 
close to the basis set limit; nevertheless, they exhibit anomalous, nonmonotonous, basis set 
convergence, the cc-pV{D,T}Z results being further away from the basis set limit. As for 
the valence (T) component, again, these two reactions converge more rapidly, i.e., for most 
systems convergence is obtained even with the cc-pVDZ basis set. 

Table S3 of the Supporting Information gives Wlh component breakdowns from the five 
isodesmic reaction energies. Several systematic trends are observed: (a) for any branched 
alkane the reaction energies both in the hypothetical motionless state ("at the bottom of 
the well") and at K increase in the following order: RE[(1,3)] < RE[5] < RE[(CH 3 )] 
< RE[(CH 2 )] < RE®; (b) the SCF, valence CCSD, valence (T), relativistic, and DBOC 
reaction components, generally, increase in the same order; (c) the inner-shell and ZPVE 
corrections decrease in the same order. 

From the above discussion, it seems that reactions Eland (1,3) offer the greatest similarity 
between the electronic structure of the reactants and products. The former has the additional 
advantage that it results in atomization energies with smaller uncertainties (Table S3, 
Supporting Information). Therefore, we select it as the 'best' reaction for deriving the 
most accurate ' quasi- W4' atomization energies. 

Table S3 (see Supporting Information) gives the TAE of the branched alkanes obtained 
from the five said isodesmic reactions (by assuming that the isodesmic reaction energy stay 
constant at the Wlh and W4 levels, and using the W4 TAEo for methane, ethane and 
propane). In practice, the resulting TAEo vary by 0.3-3.2 kcal/mol depending on which 
reaction is used and the standard deviation varies between 0.2 and 1.1 kcal/mol. It is 
interesting to note, however, that for any alkane the TAE are ordered in the same way as 
the reaction energies, namely, TAE[B] < TAE[(CH 2 )] < TAE[(CH 3 )] < TAE0 < TAE[(1,3)]. 

Table II I II compares the final ' quasi- W4' atomization energies obtained from isodesmic 
reaction [5] with experimental data taken from the CCCBDB 45J: these were adjusted for the 



15 



revised (ATcT) heat of formation of carbon atomjsjj. First, the 'quasi- W4' TAE obtained 
from W3.21ite and Wlh reaction energies are in close agreement with each other. The 
isodesmic 'quasi- W4' and adjusted CCCBDB values agree very well (to within 0.1-0.3 
kcal/mol) for all the systems but neopentane. The discrepancy of 1.1 kcal/mol seen for 
neopentane is too large to be easily explainable in terms of issues with the calculations. We 
note, however, that a discrepancy of 0.7 kcal/mol exists between our best calculated H 2 98— H 



and the value that Scott [58| used in TRC and propagated into CCCBDB; moreover, the 
computed and observed AH^ 298K are in much more plausible agreement. (We also note 
that neopentane does not have conformers and therefore these cannot introduce uncertainty 
in H298— Ho- In addition, the Ayala-Schlegel 49(] internal rotation correction is an order of 
magnitude smaller than the discrepancy.) We would argue that the experimental data for 
neopentane bear reexamination. 

Table IIHI also compares the final ' quasi- W4' heats of formation at 298 K with the 
available experimental data. There is reasonable agreement between theory and experiment 
(generally, to within 0.0-0.4 kcal/mol). In general, the available TRC values are 0.1-0.3 
kcal/mol higher than the 'quasi-W4' values, and the Rossini values are 0.0-0.4 kcal/mol 
higher than the theoretical values. 

The equilibrium geometry of isooctane (a.k.a. 2,2,4-trimethylpentane, the "100%" 
fixpoint of the "octane rating" scale) has no symmetry: for this reason we were only able 
to obtain a Wlh value for the first-order saddle point (which has C s symmetry) possessing 
an imaginary frequency (37.5i cm -1 ) that corresponds to an internal rotation. The Wlh 
TAE e for the idealized structure (Table S2, Supporting Information) is 2483.28 kcal/mol. 
For the deformation energy difference between C\ isooctane and the C s saddle point we 
obtain 0.25 kcal/mol at the CCSD(T) limit (0.135, 0.100, and 0.015 kcal/mol from the SCF, 
valence CCSD, and valence (T) components, respectively.) Assuming that the core- valence 
and relativistic contributions to the deformation energy will be zero, we obtain an estimated 
Wlh TAE e of 2483.53 kcal/mol for the equilibrium structure of isooctane. Inclusion of the 
ZPVE from a scaled B3LYP/pc2 harmonic calculation (150.69 kcal/mol) results in a Wlh 
TAE of 2332.84 kcal/mol. Using the reaction energy of reaction at the Wlh level, and 
W4 atomization energies for ethane and propane, we obtain a 'quasi- W4' atomization energy 
for isooctane of 2330.67 kcal/mol, in reasonable agreement with the NIST value of 2330.94 
kcal/mol (adjusted for the revised, ATcT, heat of formation of carbon atom). 
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D. Performance of compound thermochemistry methods for alkanes 



Table IIVI presents root mean square deviations (RMSD) for atomization energies 
relative to our best values for more approximate compound thermochemistry methods 
such as G2(MP2) 



APNO 



59), G2|60|, G3|61), G3B3 62], G4 63j, G4(MP2)|64j, CBS-QB3|65|], CBS- 
6j|, Wlhfl, W2.2h, and W3.21ite. Application of the more expensive W2.2h and 



W3.21ite methods was possible only for a subset of small systems. Starting with the zero- 
point exclusive ('bottom of the well') data, the performance of the empirically corrected Gn 
methods systematically improves as one proceeds along the series (the RMSDs are: Gl 11.2, 
G2 4.3, G3 3.7, and G4 1.3 kcal/mol). G2(MP2) performs significantly worse (RMSD=5.9) 
than the standard G2 procedure; interestingly, standard G4 offers no improvement over 
G4(MP2). We note that, while G2(MP2) does not include post-MP2 correlation effects at 
all, the somewhat confusingly named G4(MP2) does include a small basis set CCSD(T) 
step: the main differences with full G4 theory are the absence (vs. presence) of an explicit 
inner-shell correlation step and of valence MP4 steps with some intermediate-sized basis 
sets. As in the n-alkanes the inner-shell correlation term scales quite linearly with n, this is 
an optimal scenario for absorbing its effect into the empirical 'high-level correction', while 
the systems are also sufficiently dominated by dynamical correlation (as well as apolar) that 
the MPn series converges well and a single CCSD(T)/6-31G* step can adequately handle 
post-MP2 correlation effects. 

Wlh gives a RMSD of 0.9 kcal/mol for the whole set and 0.7 kcal/mol for the subset of 
small systems. Using more elaborate basis sets for the extrapolations of the SCF, CCSD, 
and (T) contributions in W2.2h cuts the RMSD for the smaller subset to 0.3 kcal/mol. 
Including post-CCSD(T) correlation effects in W3.21ite further reduces the RMSD to 0.2 
kcal/mol. For the empirical methods the RMSD for the isomerization energies are lower 
than for the atomization energies: all the methods show similar performance with RMSD of 
0.7-1.2 kcal/mol. Interestingly, while G4 outperforms both CBS-QB3 and CBS-APNO for 
the 'bottom of the well' atomization energies, CBS-QB3 and CBS-APNO both surpass G4's 
performance for the isomerization energies. 

We have already stressed that for molecules containing many hydrogens the principal 
factor limiting the accuracy of Wn methods lies in the quality of the zero-point vibrational 
energy. This is in accord with the deterioration in performance of the Wn methods when 
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zero-point corrections are included, the RMSD increase by 0.6 kcal/mol for Wlh, W2.2h, 
and W3.21ite compared to the zero-point exclusive RMSD. Despite that both Gn and Wn 
methods use ZPVEs at relatively low levels of theory (scaled HF/6-31G* and B3LYP/pc- 
2 harmonic frequencies, respectively) the performance of the Gn methods significantly 
improves due to the fact that these methods were parametrized against experimental 
atomization energies at K. Thus, some correction for the zero-point energy is evidently 
absorbed in the empirical corrections. What's more, the systematic convergence of the Gn 
methods seen for the zero-point exclusive results is not observed for the zero-point inclusive 
results: G3 shows the best performance with a RMSD of 0.5 kcal/mol, while G2 and G4 
give RMSDs of 0.8 and 0.7 kcal/mol, respectively. Again we see that G4 and G4(MP2) show 
similar performance and that G2(MP2) performs substantially worse than G2. 



E. Performance of density functional theory for alkanes 



Recent studies js 



n 



„ [Ua] have shown that DFT methods fail to adequately 
, and isodesmic [13J reaction energies involving 
alkanes. Grimme [ri| showed that DFT methods generally predict the wrong sign for the 
n-octane— ^hexamethylethane isomerization, with errors ranging from 5.4 (B2-PLYP) to 11.8 
(BLYP). Schleyer and coworkers [13I] investigated the performance of various DFT functionals 
in reproducing the experimental reaction energy of the isodesmic react ion [6] for linear alkanes 
of up to n-decane. They showed that van der Waals corrected DFT functionals (such 
as MPWB1K and MPW1B95) underestimate the protobranching stabilization energy as 
defined by reaction [U] by ~1 kcal/mol, and other functionals by up to ~2 kcal/mol, per 
protobranch. 

In the present section we compare the relative performance of different DFT exchange- 
correlation functionals in predicting atomization, isomerization, and isodesmic reaction 
energies. The XC functionals employed include the following classes (numbered by 'run g' on 
the Perdew ladder [67]): (1) the local density approximation (LDA), specifically, SVWN5 
(2) the pure generalized gradient approximation (GGA) functionals: BLYP 22j, |69j , PBE 
and HCTH407[7li; (3) the meta GGAs: TPSS[72| and M06-l|z3|; (4a) the hybrid GGAs 
(which one might term 'imperfect fourth-rung functionals'): B3PW9100I, BSLYpQE, 



68]; 

70], 



n 



n 



24j, B97-l[75j, PBE0l76fl, B97-2[77j, B97-3l78|, and X3IYP179J]; (4b) the hybrid meta-GGAs: 
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B1B95(80|, TPSSh|8l|, BMK|82j, TPSS1KCIS Jgj, PW6B950, MO60, M06-2XJ85J; and 
(5) the double hybrid functional: B2-PLYP86[ ) mPW2-PLYP[87j, B2T-PLYp|m|, B2K- 



PLYP 5l|, a nd B2GP-PLYP 54|. For the conventional functionals we use the pc-2 basis set 



of Jensen 



251 ]. which is of [As3p2dlf] quality but optimized for Hartree-Fock and DFT, while 



for MP2 and the double hybrids which exhibit slower basis set convergence we also use the 
pc-3 basis set. 

To ensure we are "comparing apples to apples", so to speak, secondary effects that are 
not explicitly included in the DFT calculations such as relativity, deviations from the Born- 
Oppenheimer approximation, and zero-point vibrational corrections are excluded from the 
reference values. For methane, ethane, and propane we use nonrelativistic, clamped-nuclei, 
zero-point exclusive TAEs from W4 theory, and for the remaining systems our best 'quasi- 
W4' values (given in the second or third column of Table ITTT|) . The B3LYP/pc-2 reference 
geometries, the said reference TAEs, and the individual errors of the various functionals can 
be found in the Supporting Information. The empirical sq scaling factors for each functional 



were taken from ref. [52, 
B97-1, B97-2, B97-3, TPSSh, 
procedure as detailed in Ref. 



54j or otherwise for SVWN5, PBE, HCTH407, BLYP, TPSS, 
P SS1KCIS, PW6B95, and B1B95 optimized using the same 



54| . The optimized Sq values, as well as error statistics over the 
S22 benchmark set of weakly interacting systems by Hobza and coworkers 55J] , can be found 
in Table S5 of the Supporting Information. The RMSD, mean signed deviations (MSD), 
and mean average deviations (MAD) for the atomization reactions are gathered in Table PVT. 
Table IVIl lists the RMSD, MSD, and MAD for the n-alkane— s>branched-alkane isomerization 
reactions with and without dispersion corrections. Finally, we consider the performance 
of DFT for the isodesmic reaction [6j Table IVHI reports the RMSD for the n-, iso-, and 
neo-alkanes as well as for the entire set. 

Before proceeding to a detailed discussion of the specific performance of the various DFT 
functionals for our four evaluation sets, a few general remarks are in order: 

• The S22 benchmark set for weak interactions contains 22 model complexes involving 
typical noncovalent interactions, such as hydrogen bonding (e.g., H2O and NH3 
dimers), dipolar and multipolar dispersion interactions (e.g., pyrazine and benzene 
dimers) . We have previously 54| made the point that small sq values can be seen as an 
indication of a functional's ability to cope with dispersion. This is demonstrated by 
an almost perfect linear correlation between the magnitude of the s 6 values and the 
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uncorrected RMSD over the S22 set seen in Table S5 (see Supporting Information). 
Excluding the M06 family of functionals (M06, M06-L, and M06-2X) and the double 
hybrids (B2K-PLYP, B2GP-PLYP, B2-PLYP, and mPW2-PLYP) all the functionals 
perform rather poorly without the dispersion correction: RMSD vary between 3- 
6 kcal/mol. Correcting for dispersion dramatically reduces the RMSD to 0.3-1.0 
kcal/mol. For comparison, Hartree-Fock gives a RMSD of 0.8 kcal/mol after correcting 
for dispersion. 

• Of our four evaluation sets, the atomization energies are clearly the toughest nut to 
crack. In general, a posteriori correction for dispersion are essential for an accurate 
estimation of the atomization energies. As expected, the dispersion corrections increase 
with the number of carbons in the alkane, and for isomers with the number of 1,3- 
interactions present. Nevertheless, the magnitude of the corrections are somewhat 
surprising: for instance, with sq— 1.0, the dispersion energy correction ranges from 0.6 
(methane) to 27. 0(!) kcal/mol (hexamethylethane). The latter amounts to 1% of the 
total atomization energy. 

• Without the dispersion correction the RMSD for the n-alkane^branched-alkane 
isomerization reactions (Table IVT1) ranges from 0.2 kcal/mol (M06-2X) to 5.4 kcal/mol 
(HCTH407). Inspection of the individual errors reveals that most functionals predict 
the wrong sign for the isomerization reactions of the larger alkanes: n-hexane, n- 
heptane, and n-octane (i.e., that the linear isomer is more stable than the branched 
isomer). MP2, SVWN5 (!), and the M06 family of functionals (M06, M06-L, and 
M06-2X) are the only ones to obtain the right sign across the board. The corrected 
results are much more encouraging: without exception, all the functionals predict that 
the branched isomers are more stable than their linear counterparts, and for most of 
the functionals the overall RMSD is below 0.5 kcal/mol, specifically X3LYP, B97-3, 
M06-2X, B2K-PLYP, and B2GP-PLYP with a RMSD below 1 kJ/mol. 

• As for the performance of DFT for the isodesmic reaction [6] (Table IVTIj) . we find 
that all functionals other than LDA underestimate the reaction energy (whether 
it involves linear or branched alkanes), confirming the findings of Schleyer and 
coworkers fl^ for linear alkanes. Also we find that without correction for dispersion 
the RMSD increases with the number of 1,3-interactions present, i.e., in the order 



20 



n-alkanes<isoalkanes<neoalkanes. This trend is of course attributed to the increase 
in dispersion interactions with the degree of branching. 

We shall start with the nonempirical functionals SVWN5 LDA, PBE GGA, and TPSS 
meta-GGA functionals and their corresponding hybrid functionals PBEO and TPSSh. The 
SVWN5 functional fails miserably for the alkane atomization energies, with a RMSD of 
over 200 kcal/mol, and will be omitted from further discussion, other than to say that it 
performs surprisingly well for the isomerization and isodesmic reactions. For the S22 set 
the nonempirical functionals perform rather poorly with uncorrected RMSD between 4-5 
kcal/mol and corrected RMSD between 0.7-1.0 kcal/mol. For the atomization reactions 
(Table EJ the nonempirical functionals TPSS, PBE, TPSSh, and to a lesser extent PBEO 
systematically overbind the alkanes (and in fact, are the only functionals other than LDA 
that do so) as evident from MSD^MAD. Obviously, there is no point in 'correcting' for 
dispersion then, as it can only increase the errors further. Indeed, if the empirical Sq scaling 
factors are reoptimized by minimizing the RMSD for the atomization energies, then negative 
(anomalous) Sq values are obtained. The uncorrected RMSD for the atomization reactions 
are 4.2, 13.7, 3.7, and 1.7 kcal/mol for TPSS, PBE, TPSSh, and PBEO, respectively (we note 
that without the dispersion correction PBEO exhibits the best performance of the functionals 
considered). For the linear— ^-branched isomerization reactions the uncorrected RMSD are on 
the order of 3 kcal/mol and the corrected RMSD vary between 0.3-0.9 kcal/mol, where PBEO 
and PBE are the best performers. For the isodesmic reaction El the uncorrected RMSD vary 
between 5-7 kcal/mol and the corrected RMSD between 0.4-1.3 kcal/mol, again with PBE 
and PBEO as best performers. 

In the following discussion the empirical functionals are conveniently divided into three 
categories: lightly, moderately, and heavily parameterized. The GGA BLYP, hybrid GGA 
B3PW91 and B3LYP, and hybrid meta GGA TPSS1KCIS and B1B95 functionals belong to 
the first category. The uncorrected RMSD over our four validation sets (S22, atomization, 
isomerization, and isodesmic reactions) are unacceptably large ranging from 2-37 kcal/mol. 
After applying the dispersion correction, B3LYP emerges as the best performer with RMSD 
of 0.7, 1.9, 0.5, and 0.6 kcal/mol for the said four validations sets, respectively, followed 
by B1B95 with RMSD of 0.5, 2.5, 0.8, and 0.9 kcal/mol, respectively. We note that 
BLYP, B3PW91 and TPSS1KCIS yield comparable RMSDs for the S22, isomerization, and 
isodesmic reactions, but the former grossly underestimates the atomization energies and the 
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latter two largely overestimate the atomization energies. 

The X3LYP and PW6B95 functionals may be regarded as belonging to the second 
category. Again the RMSD before correcting for dispersion are unacceptably large (2- 
13 kcal/mol). After correcting for dispersion both functionals perform relatively well, 
particularly PW6B95 with RMSD of 0.5, 0.2, 0.3, and 1.6 kcal/mol for the S22, atomization, 
isomerization, and isodesmic reactions, respectively. 

The heavily parametrized functionals include the GGA HCTH407, meta GGA M06-L, 
hybrid GGAs B97-1, B97-2, B97-3, and hybrid meta GGAs BMK, M06, and M06-2X. The 
M06 family of functionals (M06-L, M06, and M06-2X) have exceptionally low sq values 
(0.20, 0.25, and 0.06, respectively). Without the dispersion correction M06-2X performs 
relatively well for the S22 and isomerization reactions, but grossly underestimates the 
atomization energies. All the other functionals yield unacceptably large errors without 
the dispersion correction. After correcting for dispersion, BMK and B97-2 seem to offer the 
best performance with BMK RMSDs of {0.6, 1.9, 1.3, 0.6} kcal/mol and B97-2 RMSDs of 
{0.5, 2.6, 0.4, 0.6} kcal/mol for the S22, atomization, isomerization, and isodesmic datasets. 

Finally, we come to the double-hybrid class of functionals (which may be considered 
lightly parameterized). It is perhaps not surprising that (for the BLYP-based double hybrids) 
the Sq values decrease with the percentage of MP2 correlation included. Furthermore, 
the performance of the double hybrides (over the four evaluation test sets) systematically 
improves with the percentage of MP2 correlation included: most notably for the atomization 
energies RMSD of 12.5, 8.7, 8.0, 5.5, and 4.2 kcal/mol are obtained with the mPW2-PLYP 
(25% MP2), B2-PLYP (27% MP2), B2T-PLYP (31% MP2), B2GP-PLYP (36% MP2), and 
B2K-PLYP (42% MP2) functionals, respectively. (For this narrow application, the original 



'thermochemistry and H-transfer barriers' parametrized double hybrid B2K-PLYP|5jJ thus 
slightly outperforms the more 'general-purpose' parametrized B2GP-PLYP 54j.) Similar 
trends are seen for the other three evaluation sets. After correction for dispersion, the 
performance of the double hybrids is quite remarkable: for the four validation sets (S22, 
atomization, isomerization, and isodesmic reactions), B2GP-BLYP yields RMSD of 0.4, 0.2, 
0.2, and 0.5 kcal/mol, respectively, while B2K-PLYP slightly bests it for the isomerization 
energies and slightly underperforms it for the isodesmic reactions (0.1 kcal/mol). We note 
that B2T-BLYP and B2-PLYP yield similar RMSD for the S22, isomerization, and isodesmic 
reactions, but an RMSD of ~1.0 kcal/mol for the atomization reactions. 
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As a numerical experiment, we reoptimized the empirical Sq scaling factor to minimize 
the RMSD for the atomization reactions (Table IV]) . Obviously this approach also corrects 
for deficiencies other than dispersion, such as basis set incompleteness and/or limitations of 
the XC functional. Nevertheless, for PW6B95, B2GP-PLYP, and B2K-PLYP there was no 
significant change in the sq values or RMSD, indicating that (a) the original sq values are 
optimal; and (b) that these functionals do not exhibit any systematic errors. We note that 
for a few other functionals, namely X3LYP, B3LYP, BMK, B2T-PLYP, and B2-PLYP, the 
reoptimized values are very similar to the original ones but the RMSD is improved by 
0.6-1.2 kcal/mol. 

F. Hydrocarbon prototype reactions 

As discussed by Allen and coworkers [^J, the heat of formation at K (viz. 
the atomization energy) of an arbitrary acyclic hydrocarbon can be obtained from 
hypohomodesmotic reactions involving a very limited number of branching prototypes: 
methane, ethane, ethene, acetylene, propane, propene, propyne, allene, isobutane, 
neopentane, and isobutene. As the atomization energies of the first four species are very well 
established both by W4 theory and by ATcT, this leaves seven species for which it would 
be highly desirable to have highly accurate reference values. W4 component breakdowns 
from methane, ethane, ethene, acetylene, propane, propene, propyne, and allene are given 
in Table H The final W4 atomization energies (Table!]) are: 392.47, 666.18, 532.09, 388.70, 
942.95, 811.53, 670.45, and 669.37 kcal/mol, respectively. For isobutane, neopentane, and 
isobutene we obtain 'quasi- W4' TAE from isodesmic reaction energies at the W3.21evel 
by using reaction [5] for the former two and isobutene+CH4— s^h^+CaHg for the latter 
(component breakdowns from the said reactions are given in Table HT]) . Using W4 reference 
values for ethane, propane, and ethene we obtain atomization energies of 1221.73, 1501.01, 
and 1092.10 kcal/mol for isobutane, neopentane, and isobutene, respectively. 

IV. CONCLUSIONS 

Benchmark post-CCSD(T) atomization energies and heats of formation for C n H 2n+2 (all 
isomers up to n =6 inclusive, plus selected isomers for n =7 and n =8) have been obtained. 
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Excellent agreement with the best available experimental data has been observed except 
that an issue with the experimental enthalpy function of neopentane has been identified. 

The hypohomodesmotic reaction energy (eq. G3, in conjunction with linear alkanes) 
converges very rapidly with the level of theory. Valence post-CCSD correlation effects 
are thermochemically negligible (at least for the small alkanes considered in the present 
work). Basis set requirements are likewise rather modest the SCF component converges 
with the cc-pV{D,T}Z basis sets and CCSD component with the cc-pV{T,Q}Z basis sets. 
Scalar relativistic and DBOC contributions are thermochemically insignificant. Inner-shell 
correlation accounts for 25-30% of the reaction energy at the 'bottom of the well'. 

For isodesmic reactions convergence of the one- and n-particle spaces depends on the 
nature of the reaction: it is found that balancing the number of 1,3-ineractions between 
reactants and products significantly accelerates the convergence. 

We evaluated the performance of popular compound thermochemistry methods in 
reproducing the atomization energies. Post-CCSD(T) corrections are necessary for obtaining 
sub kJ/mol accuracy for the 'bottom of the well' atomization energies, as W3.21ite is the only 
method that achieves this goal. W2.2h and Wlh are capable of sub kcal/mol predictions; 
and G4, G4(MP2), and CBS-APNO afford ~1 kcal/mol accuracy. For organic systems 
containing many hydrogens the zero-point corrections may be considered the 'Achilles' 
Heel' of nonempirical thermochemical methods, since the error due to neglect of explicit 
anharmonicity is on the order of one kcal/mol. Accordingly, the RMSDs of the nonempirical 
Wn methods increase by 0.6 kcal/mol when zero-point corrections are included (specifically, 
the RMSDs are 1.7, 0.9, and 0.8 kcal/mol for Wlh, W2.2h, and W3.21ite, respectively). 
The performance of the empirical Gn methods (G2, G3B3, G3, G4(MP2), and G4), on the 
other hand, improves upon inclusion of ZPVE corrections (with RMSD ranging between 
0.4-0.6 kcal/mol), presumably due to the fact that their empirical correction terms were 
fitted against experimental atomization energies at K. 

The performance of different DFT exchange-correlation functionals in predicting 
atomization, isomerization and isodesmic reaction energies was evaluated. The atomization 
reactions are clearly the most daunting test; taking dispersion corrections into account three 
best performers emerge: PW6B95, B2K-PLYP, and B2GP-PLYP, all of which attain a 
near-zero RMSD of 0.2 kcal/mol. For the isodesmic reactions the three said best performers 
attain RMSD of 1.7, 0.6, and 0.5 kcal/mol, respectively. And for the isomerization reactions 
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(for which almost all the functionals perform exceptionally well) said best functionals attain 
RMSD of 0.3, 0.1, and 0.2 kcal/mol, respectively. 
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TABLE I: Component breakdown of the final W4 total atomization energies at the bottom of 
the well (TAE e ) and comparison between W4 total atomization energies at K (TAEq), Active 



Thermochemical Tables benchmarks, and CCCBDB reference data (in kcal/mol) a 





SCF 


CCSD 


(T) 


T 3 -(T) 


fi 


T 5 


inner 
shell 


relativ. 


spin 
orbit 


DBOC 


(a) 


M-A b 


TAE C 


ZPVE C 


TAE d 


ATcT d ' e 


CCCBDB^ 


CCCBDB' 


methane 


331.55 


84.72 


2.89 


-0.08 


0.08 


0.00 


1.27 


-0.19 


-0.08 


0.10 


-0.05 


0.02 


420.21 


27.74 


392.47±0.16 


392.50±0.02 


392.51±0.07 


392.4±0.1 


ethane 


558.01 


146.34 


6.37 


-0.35 


0.23 


0.01 


2.43 


-0.39 


-0.17 


0.14 


-0.08 


0.04 


712.58 


46.39 


666.18±0.16 


666.16±0.05 


666.25±0.11 


666.1±0.2 


ethene 


434.97 


119.32 


7.40 


-0.46 


0.43 


0.03 


2.38 


-0.33 


-0.17 


0.12 


-0.07 


0.04 


563.64 


31.60 


532.04±0.16 


532.00±0.05 


532.06±0.13 


531.9±0.2 


acetylene 


299.87 


94.73 


8.35 


-0.72 


0.70 


0.07 


2.49 


-0.28 


-0.17 


0.12 


-0.05 


0.04 


405.13 


16.46 


388.67±0.16 


388.67±0.05 


388.90±0.20 


388.7±0.3 


propane 


785.34 


209.05 


10.12 


-0.63 


0.38' 1 


N/A 


3.61 


-0.58 


-0.25 


0.20 


-0.11 


0.06 


1007.15 


64.20 


942.95±0.16 


942.78±0.10 


942.92±0.14 


942.7±0.3 


propene 


665.18 


181.71 


11.17 


-0.76 


0.60'' 


0.04 ! ' 


3.61 


-0.52 


-0.25 


0.18 


-0.09 


0.06 


860.89 


49.36 


811.53±0.16 


N/A 


811.67±0.27 


811.4±0.4 


propyne 


534.14 


155.80 


11.94 


-1.01 


0.87 


0.07 ; 


3.76 


-0.48 


-0.25 


0.17 


-0.07 


0.06 


704.97 


34.52 


670.45±0.16 


N/A 


670.91±0.22 


670.7±0.4 


allene 


532.55 


155.63 


12.27 


-0.84 


0.74 


0.05* 


3.67 


-0.47 


-0.25 


0.15 


-0.07 


0.06 


703.45 


34.08 


669.37±0.16 


N/A 


669.28±0.27 


669.1±0.4 



(a)ADBOC=DBOC[CCSD/cc-pVDZ]-DBOC[HF/cc-pVDZ] correction. 
a W4 values for methane, ethane, ethene, and acetylene are from ref. and for propyne and allene from ref. |54j (except for 
the ZPVE corrections which were calculated in the present work, see computational details section); W4h values for propane 
and propene are from the present work. Note that the atomization energies include a post-SCF contributions to the diagonal 
Born-Oppenheimer correction, cfr. footnote (a). 

^Difference between the ACES II and MOLPRO definitions of the valence ROCCSD(T), half of this contribution is added to 
the final TAE as discussed in the appendix of ref. \3. 

c Methane, ethane, ethene, and acetylene from ref [2| ; propane, propene, propyne, and allene this work (see computational 
details section). 

d The adjunct uncertainties correspond to 95% confidence intervals. 

e ATcT reference values for methane, ethane, ethene, acetylene, and propane are taken from refs. 

f Using ATcT atomic heats of formation at K in converting the CCCBDB Q| molecular heat of formation at K into an 
atomization energy (see computational details section). 

9 Using CODATA atomic heats of formation at K in converting the CCCBDB^sj molecular heat of formation at K into 
an atomization energy. 

''■Here f 4 ss l.l(E[CCSDT(Q)/cc-pVTZ(no d on H)] - £[CCSDT/cc-pVTZ(no d on H)] + 
£[CCSDTQ/cc-pVDZ(no p on H)] - E[CCSDT(Q)/cc-pVDZ(no p on H)]. 

{ The f 5 contribution approximated as CCSDTQ(5) A /cc-pVDZ(no d)-CCSDTQ/cc-pVDZ(no d). 
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TABLE II: Component breakdown of the final W3.21ite total atomization energies and from 



hypohomodesmotic and isodesmic reactions (in kcal/mol). 





SCF 


valence valence T 3 — (T) 


(Q) 


inner 


rclativ. 


spin 


DBOC 


M-A a 


TAE e 


ZPVE b 


TAE 






CCSD 


(T) 






shell 




orbit 












methane 


331.54 


84.76 


2.94 


-0.12 


0.09 


1.28 


-0.19 


-0.08 


0.10 


0.02 


420.32 


27.57 


392.75 


ethane 


558.00 


146.40 


6.43 


-0.33 


0.25 


2.45 


-0.39 


-0.17 


0.14 


0.04 


712.81 


45.98 


666.83 


propane 


785.21 


209.08 


10.21 


-0.56 


0.43 


3.59 


-0.58 


-0.25 


0.20 


0.06 


1007.35 


63.69 


943.67 


n-butane 


1012.62 


271.72 


13.96 


-0.82 


0.61 


4.81 


-0.77 


-0.34 


0.25 


0.08 


1302.08 


81.26 


1220.83 


n-pentane 


1239.85 


334.45 


17.75 


-1.07 


0.79 


5.99 


-0.96 


-0.44 


0.30 


0.10 


1596.73 


98.88 


1497.87 


isobutane 


1012.93 


272.63 


14.17 


-0.84 


0.62 


4.81 


-0.76 


-0.34 


0.26 


0.08 


1303.51 


81.01 


1222.51 


neopentane 


1240.03 


337.34 


18.41 


-1.16 


0.82 


5.97 


-0.95 


-0.42 


0.32 


0.10 


1600.41 


98.56 


1501.86 


isobutene 


895.34 


244.67 


15.09 


-0.89 


0.74 


4.92 


-0.71 


-0.34 


0.24 


0.08 


1159.10 


66.44 


1092.67 



Hypohomodesmotic reactions, eq. [5] 



n-butane 


0.20 


-0.03 


-0.03 


-0.01 


0.00 


0.07 


0.00 [( 


)] 0.00 [c 


] 0.19 


0.13 


0.32 


n-pentane 


0.21 


0.02 


-0.02 


-0.03 


0.01 


0.11 


0.00 [( 


)] -0.01 [c 


] 0.30 


0.22 


0.52 



Isodesmic reactions, eq. \E\ 



isobutane 




0.51 


0.88 


0.18 -0.03 0.01 0.07 


0.01 [( 


)] 0.00 [0 


1.62 


0.39 


2.01 


neopentane 




0.39 


2.90 


0.64 -0.11 0.04 0.09 


0.02 [( 


1] 0.02 [0 


3.99 


0.53 


4.52 


Isodesmic reactions, eq. \E\ 


isobutane 




2.00 


2.96 


0.75 -0.09 0.04 0.03 


0.02 [C 


] 0.03 [0 


5.73 


1.80 


7.53 


neopentane 




2.63 


6.03 


1.49 -0.20 0.08 0.03 


0.03 [( 


1] 0.05 [0 


10.15 


2.65 


12.80 




Isodesmic reactions balance the number of 1,3-interactions on 


both sides 








isobutane+3C2 H6 — 1 


CH 4 +3C 3 H 8 


-0.24 


-0.17 


-0.11 0.00 0.00 0.09 


0.00 [C 


1] -0.01 [0 


-0.43 


-0.32 


-0.76 


neopentane+8C2 H6 


^3CH 4 +6C 3 H 8 


-1.85 


-0.23 


-0.22 -0.02 -0.01 0.15 


0.00 [C 


] -0.02 [0 


-2.18 


-1.59 


-3.77 



Isodesmic reactions balance the number of CH3 groups on both sides 



2isobutane+CH4 - 


^3C 3 H 8 


1.76 


2.79 


0.64 


-0.10 


0.04 


0.12 


0.02 [( 


)] 0.02 [C 


] 5.30 


1.48 


6.78 


neopentane+CH4 


^2C 3 H 8 


1.14 


3.94 


0.92 


-0.14 


0.05 


0.07 


0.02 [( 


)] 0.03 [C 


] 6.04 


1.24 


7.28 


isobutene+CH 4 ^ 


C 2 H 4 +C 3 H 8 


6.47 


1.25 


0.46 


-0.09 


0.07 


0.15 


0.00 [( 


)] 0.03 [C 


] 8.34 


1.19 


9.53 



a The difference between the ACES II and MOLPRO definitions of the valence ROCCSD(T). 
f 'B3LYP/pc-2 harmonic frequencies scaled by 0.985 
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TABLE III: Total atomization energies at K and heats of formation at 298 K (in kcal/mol). 
'Quasi-W4' atomization energies are estimated from the hypohomodesmotic reaction eq. [5] for 



linear alkanes and from the isodesmic reaction eq. [5] for branched alkanes. 





TAE C ° 






TAE 






- H 




AH f,29S 






Quasi-W4 b 


Quasi- W4 C 


Wlh 


W3.2Ute Quasi-W4 i> Quasi-W4 c 


Expt. d 


Theor.° 


TRC™ 


'Quasi-W4' 


CCCBDB 


Webbook 






















TRC" 


Rossini e 


Other 


rc-butane 


1302.90 


1302.89 


1221.42 


1220.83 


1220.06 1220.04 


1220.10±0.20 


4.62 


4.61 


-30.00 


-30.06±0.17 


-30.37±0.16 


-30.03i0.169 


7i-pentane 


1597.84 


1597.81 


1498.61 


1497.87 


1497.04 1497.01 


1497.33±N/A 


5.70 


5.78 


-34.84 


-35.09 


-35.00±0.16 


-35.16i0.24 h 
-35.08±0.14 i 


n-hexane 


1892.79 




1775.91 




1774.15 


1774.28±N/A 


6.79 


6.86 


-39.84 


-39.89 


-39.96±0.19 


-39.94^ 


rv~ heptane 


2187.76 




2053.14 




2051.17 


2051.40i0.27 / 


7.87 


7.94 


-44.74 




-44.89±0.19 


-45.24* 


n- octane 


2482.72 




2330.46 




2328.30 


2328.45iN/A-f 


8.97 


9.03 


-49.71 




-49.82±0.16 


-49.88' 


isobutane 


1304.35 


1304.31 


1223.13 


1222.51 


1221.76 1221.73 


1221.98±0.17 


4.29 


4.30 


-32.01 


-32.27±0.14 


-32.42±0.13 


-32.07i0.159 


isopentane 


1598.74 




1499.83 




1498.27 


1498.48±N/A 


5.30 


5.26 


-36.49 


-36.74 


-36.92±0.20 


-36.84±0.23'' 
-36.73i0.14 1 


neopentane 


1601.53 


1601.46 


1502.63 


1501.86 


1501.06 1501.01 


1502.14±N/A 


4.84 


5.54 


-39.69 


-40.13 


-39.67±0.25 


-40.27i0.24'' 
-40.14i0.15 1 


isoliexane 


1893.67 




1 77R on 




1775.22 


lit O.OOZtlN / A 


6.28 


6.29 




-41.73 


-41.66±0.25 




3-methylpentane 


1893.02 




1776.38 




1774.61 


1774.86±N/A 


6.37 


6.23 


-40.72 


-41.11 


-41.02±0.23 




diisopropyl 


1893.87 




1777.43 




1775.67 


1775.41±N/A 


6.07 


5.85 


-42.08 


-42.04 


-42.49±0.24 




neohexane 


1895.28 




1779.04 




1777.27 


1777.49±N/A 


5.98 


6.01 


-43.77 


-43.98 


-44.35±0.23 




isoheptane 


2188.66 




2054.40 




2052.43 


2052.57iN/A-f 


7.44 


7.39 


-46.43 




-46.60±0.30 


-46.89 fc 


neoheptane 


2190.19 




2056.38 




2054.41 




7.01 


6.98 


-48.84 




-49.29±0.32 


-50.07'= 


hexamethylethane 


2484.09 




2332.98 




2330.81 




7.52 


7.53 


-53.68 




-53.99±0.46 


-54.06' 


isooctane 


2484.26 




2332.84 




2330.67 


2330.86i0.37-f 


7.77 


7.69 


-53.29 




-53.57±0.32 





a W4 zero-point exclusive, nonrelativistic, clamped-nuclei atomization energies for testing/paramctrization of DFT 
functionals. 'Using Wlh reaction energies and W4 values for ethane and propane. c Using W3.21ite reaction energies and W4 
values for ethane and propane. d n-octane and isooctane from ref. [cii! . all the rest from the experimental data section of ref. 
[45I ; using ATcT atomic heats of formation at K in converting the molecular heat of formation at K into an atomization 
energy (see computational details section). e n-butane and isobutane data were instead taken from E. J. Prosen, F. W. 
Maron, and F. D. Rossini, J. Res. NBS 46, 106 (1951), higher alkanes from E. J. Prosen and F. D. Rossini, J. Res. NBS 38, 
263 (1945). -f Heats of formation at 298 K were converted to K using the TRC enthalpy functions, J/298 — Ho, for the 
molecules (see table) and the CODATaJ^ enthalpy functions for the elemental reference states: 

#298 - J/o[H 2 (g)]=2.024±0.000, and H 2 gg - #o[C(cr,graphite)]=0.251±0.005 kcal/mol. »D. A. Pittam and G. Pilcher J. 
Chem. Soc. Faraday Trans. 68, 2224-2229 (J. Chem. Soc. Faraday Trans. 1, 1972, 68, 2224-2229). h G. Pilcher and J. D. M. 
Chadwick, Trans. Faraday Soc. 63, 2357-2361 (1967). l W. D. Good, J. Chem. Thermodyn. 2, 237-244 (1970). 'W. D. Good 
and N. K. Smith J. Chem Eng. Data 14, 102-106 (1969); liquid combined with Rossini vaporization enthalphy (D. R. 
Burgess). k G. F. Davis and E. C. Gilbert J. Am. Chem. Soc. 63, 2730-2732 (1941); liquid combined with Rossini 
vaporization enthalphy (D. R. Burgess). l W. D. Good, J. Chem. Thermodyn. 4, 709-714 (1972); liquid combined with 
Rossini vaporization enthalphy (D. R. Burgess). m TRC (Thermodynamic Research Center) database l4sil . n Ref. \3l All values 
except for n-butane appear to have been taken from Ref. [5^- For n- butane, the latter source lists 4.71 kcal/mol. °The 
translational, rotational, and vibrational contributions were obtained within the RRHO (rigid rotor-harmonic oscillator) 
approximation from the B3LYP/pc-2 calculated geometry and harmonic frequencies. Internal rotation corrections were 
obtained using the Ayala-Schle gel methodic . again on the B3LYP/pc-2 potential surface. Conformer corrections were taken 
from a recent benchmark study [501] - The specific data given here were obtained at the 
B2K-PLYP-D/pc-2//PW6B95/6-311G** level. See Computational Methods section for further details. 
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TABLE IV: Root mean square deviations (relative to our best values, in kcal/mol) of popular 
compound thermochemistry methods for the atomization and isomerization energies of the alkanes 
considered in the present work. 





2 


ero-point exclusive 


zero-point inclusive 




Large set 


1 Small set b 


Tcnni r i vi r 7Ci'ririn'- 

loUlllLl IZjOjLILJII 


T.nT'ov" 1 cot"' 




Tqcwyi nvwci'rirtt'i'-" 

loUlilLl LZjCX L1LJ11 


Gl 


11.21 


7 fi1 


1 1 Q 






n Q7 


G2(MP2) 


5.87 


o.yu 


117 
1.11 


2 24 


1 31 

l.Ol 


u.yu 


G2 


4.27 


9 £9 


1 l(i 
1 . ID 


n 77 


n A7 




G3B3 


3.16 


O 19 

z. ±o 




n 7^ 




U.OU 


G3 


3.73 


2.57 


1.05 


0.45 


0.36 


0.83 


G4(MP2) 


1.31 


1.03 


1.00 


0.77 


0.61 


1.01 


G4 


1.29 


0.95 


0.93 


0.74 


0.53 


0.95 


CBS-QB3 


2.78 


1.76 


0.79 


2.50 


1.53 


0.79 


CBS-APNO d 


1.83 


1.17 


0.71 


3.02 


2.23 


0.60 


Wlh 


0.94 


0.73 




1.71 


1.28 




W2.2h 




0.32 






0.91 




W3.21ite 




0.18 






0.76 





"Includes all the Ci-Cs alkanes considered in the present work, 
'includes all the Ci— C5 alkanes (w/o isopentane). 
Tor the linear^branched isomerization reactions. 

dr Not including isooctane, lor which the calculation was deemed too demanding in computer time. 
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TABLE V: Performance statistics (in kcal/mol) of various exchange-correlation functionals with 
and without the sq correction for the atomization energies of the C±—Cs alkanes considered in the 
present work. 











uncorrected 






corrected 




Functional 


Basis set 




RMSD 


MSD 


MAD 


RMSD 


MSD 


MAD 




RMSD pt 


SVWN5 


pc- 


2 


-0.25 


231 


218 


218 


227 


215 


215 


-16 


44 


PBE 


IX- 


2 


0.75 


13.70 


12.65 


12.71 


24.07 


22.08 


22.10 


-0.93 


3.75 


HCTH407 


IX- 


2 


1.10 


23.95 


-21.90 


21.90 


8.49 


-8.07 


8.07 


1.69 


1.50 


BLYP 


no 


2 


1.20 


36.70 


-33.86 


33.86 


19.93 


-18.77 


18.77 


2.58 


4.03 


TPSS 


pc- 


2 


1.00 


4.22 


3.56 


3.91 


16.81 


16.13 


16.13 


-0.16 


3.53 


M06-L 




2 


0.20 


12.27 


-11.58 


11.58 


9.50 


-9.06 


9.06 


0.86 


2.05 


PBEO 


pc- 


2 


0.70 


1.70 


0.55 


1.29 


10.57 


9.35 


9.57 


-0.04 


1.61 


B3PW91 


pc- 


2 


1.10 


7.02 


-6.00 


6.00 


8.63 


7.83 


7.85 


0.49 


0.90 


X3LYP 


pc- 


2 


0.85 


13.53 


-11.91 


11.95 


1.57 


-1.22 


1.39 


0.96 


0.43 


B97-3 


pc- 


2 


0.90 


15.63 


-14.29 


14.29 


3.03 


-2.97 


2.97 


1.10 


0.91 


B3LYP 


pc- 


2 


1.05 


16.60 


-14.69 


14.74 


1.86 


-1.49 


1.67 


1.17 


0.64 


B97-2 


pc- 


2 


1.05 


17.03 


-15.77 


15.77 


2.59 


-2.57 


2.57 


1.20 


1.48 


B97-1 


IX- 


2 


0.65 


20.41 


-14.29 


14.29 


11.38 


-2.97 


2.97 


1.43 


2.85 


TPSSh 


pc- 


2 


0.90 


3.69 


2.82 


3.38 


14.66 


14.14 


14.14 


-0.11 


3.34 


TPSS1KCIS 


pc- 


2 


0.90 


6.33 


-5.02 


5.18 


6.64 


6.29 


6.29 


0.44 


1.28 


PW6B95 


pc- 


2 


0.50 


7.03 


-6.27 


6.27 


0.21 


0.02 


0.15 


0.50 


0.21 


BMK 


pc- 


2 


0.65 


7.39 


-6.82 


6.82 


1.94 


1.36 


1.54 


0.52 


0.65 


M06-2X 


pc- 


2 


0.06 


7.42 


-7.13 


7.13 


6.59 


-6.37 


6.37 


0.51 


1.63 


M06 


pc- 


2 


0.25 


7.56 


-7.12 


7.12 


4.16 


-3.97 


3.97 


0.52 


1.51 


B1B95 


pc- 


2 


0.75 


12.87 


-11.89 


11.89 


2.52 


-2.46 


2.46 


0.91 


1.18 


B2K-PLYP 


pc- 


3« 


0.30 


4.16 


-3.78 


3.78 


0.22 


-0.01 


0.16 


0.29 


0.20 


B2GP-PLYP 


pc- 


oa 
O 


0.40 


5.51 


-4.95 


4.95 


0.20 


0.08 


0.14 


0.39 


0.13 


B2T-PLYP 


pc- 


3 a 


0.48 


7.99 


-7.22 


7.22 


1.25 


-1.18 


1.18 


0.56 


0.37 


B2-PLYP 


pc- 


3 a 


0.55 


8.71 


-7.81 


7.81 


0.98 


-0.90 


0.90 


0.62 


0.31 


mPW2-PLYP 


pc- 


3 a 


0.40 


12.48 


-11.39 


11.39 


6.86 


-6.36 


6.36 


0.88 


0.96 


B2K-PLYP 


F^ 


2 b 


0.30 


5.79 


-5.36 


5.36 


1.62 


-1.59 


1.59 


0.41 


0.57 


B2GP-PLYP 


pc- 


2 6 


0.40 


7.13 


-6.52 


6.52 


1.54 


-1.49 


1.49 


0.50 


0.52 


B2T-PLYP 


pc- 


2 6 


0.48 


9.59 


-8.76 


8.76 


2.87 


-2.73 


2.73 


0.68 


0.75 


B2-PLYP 


pc- 


2 b 


0.55 


10.35 


-9.40 


9.40 


2.64 


-2.49 


2.49 


0.73 


0.69 


mPW2-PLYP 


pc- 


2 b 


0.40 


14.04 


-12.88 


12.88 


8.43 


-7.86 


7.86 


0.99 


1.33 


MP2 


pc- 


3 


-0.15 


4.08 


-3.99 


3.99 


5.89 


-5.88 


5.88 


0.23 


2.56 


MP2-SCS 


pc- 


3 


0.17 


2.78 


-2.63 


2.63 


0.56 


-0.49 


0.53 


0.19 


0.44 


MP2 


pc- 


2 


-0.15 


18.42 


-17.97 


17.97 


20.46 


-19.85 


19.85 


1.25 


5.43 


MP2-SCS 


pc- 


2 


0.17 


19.90 


-18.88 


18.88 


17.55 


-16.74 


16.74 


1.38 


3.98 



"(frozen core) pc-3 basis set combined with a CBS extrapolation where Nmin=15 as recommended in Ref. 
b (frozen core) pc-2 basis set combined with a CBS extrapolation where Nmin=10 as recommended in Ref. 
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TABLE VI: Performance statistics (in kcal/mol) of various exchange-correlation functionals with 
and without the sq correction for the reaction energies of the isomerization linear-alkane— >branched- 
alkane for the C4-C8 alkanes considered in the present work. 



Functional 



SVWN5 



PBE 

HCTH407 
BLYP 



TPSS 
M06-L 



PBEO 

B3PW91 

X3LYP 

B97-3 

B3LYP 

B97-2 

B97-1 



TPSSh 

TPSS1KCIS 

PW6B95 

BMK 

M06-2X 

M06 

B1B95 



B2K-PLYP 

B2GP-PLYP 

B2T-PLYP 

B2-PLYP 

mPW2-PLYP 

B2K-PLYP 

B2GP-PIYP 

B2T-PLYP 

B2-PLYP 

mPW2-PIYP 



MP2 

MP2-SCS 
MP2 

MP2-SCS 



Basis set 



pc-2 



pc-2 
pc-2 
pc-2 



pc-2 
pc-2 



pc-2 
pc-2 
pc-2 
pc-2 
pc-2 
pc-2 
pc-2 



pc-2 
pc-2 
pc-2 
pc-2 
pc-2 
pc-2 
pc-2 



pc-3 a 
pc-3 Q 
pc-3 a 
pc-3 a 
pc-3 a 
pc-2 b 
pc-2 6 
pc-2 6 
pc-2 6 
pc-2 b 



pc-3 
pc-3 
pc-2 
pc-2 



■0.25 



0.75 
1.10 
1.20 



1.00 
0.20 



0.70 
1.10 
0.85 
0.90 
1.05 
1.05 
0.65 



0.90 
0.90 
0.50 
0.65 
0.06 
0.25 
0.75 



0.30 
0.40 
0.48 
0.55 
0.40 
0.30 
0.40 
0.48 
0.55 
0.40 



-0.15 
0.17 
-0.15 
0.17 



uncorrected 
RMSD MSD MAD 



0.67 -0.56 0.56 



2.90 
5.39 
4.52 



2.34 2.34 
4.26 4.26 
3.65 3.65 



3.50 2.90 2.90 
1.07 0.94 0.94 



2.79 
3.82 
3.65 
3.89 
3.99 
4.18 
3.09 



2.26 2.26 

3.11 3.11 

2.94 2.94 

3.15 3.15 

3.22 3.22 

3.38 3.38 

2.51 2.51 



3.42 
3.74 
2.09 
1.52 
0.21 
0.26 



2.83 2.83 

3.06 3.06 

1.76 1.76 

1.27 1.27 

0.19 0.19 

-0.15 0.18 



2.47 2.09 2.09 



1.20 
1.52 
1.80 
2.06 
2.15 
1.03 
1.37 
1.67 
1.94 
2.04 



0.96 0.96 

1.22 1.22 

1.45 1.45 

1.66 1.66 

1.73 1.73 

0.84 0.84 

1.11 1.11 

1.35 1.35 

1.57 1.57 

1.65 1.65 



1.12 
0.28 
1.24 
0.13 



-0.90 0.90 

0.23 0.23 

-0.97 0.97 

0.11 0.11 



corrected 
RMSD MSD MAD 



0.44 0.30 0.30 



0.36 -0.25 0.28 
0.68 0.46 0.46 
0.68 -0.50 0.51 



0.87 
0.48 



-0.55 0.60 
0.25 0.40 



0.26 
0.95 
0.13 
0.17 
0.56 
0.38 
0.33 



-0.15 0.19 

-0.68 0.69 

0.00 0.10 

0.04 0.12 

-0.41 0.42 

-0.24 0.29 

0.26 0.28 



0.55 
0.27 
0.28 
1.31 
0.15 
1.31 
0.83 



-0.28 0.36 

-0.04 0.19 

0.03 0.20 

-0.98 0.98 

-0.02 0.10 

-1.01 1.01 

-0.50 0.54 



-0.08 0.08 

-0.16 0.16 

-0.21 0.21 

-0.24 0.24 



0.11 
0.21 
0.27 
0.32 

0.43 0.35 0.35 

-0.20 0.20 

-0.27 0.27 

-0.31 0.31 

-0.32 0.32 

0.27 0.27 



0.27 
0.35 
0.40 
0.43 
0.34 



0.48 
0.47 
0.60 
0.67 



-0.38 0.38 

-0.35 0.35 

-0.45 0.45 

-0.48 0.48 



"(frozen core) pc-3 basis set combined with a CBS extrapolation where Nmin=15 as recommended in Ref. [90 
''(frozen core) pc-2 basis set combined with a CBS extrapolation where Nmin=10 as recommended in Ref. [9y 
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TABLE VII: Performance statistics (RMSD, kcal/mol) of various exchange-correlation functionals 
with and without the sq correction for the isodesmic reaction [6] for the C3-C8 alkanes considered 



in the present work. 











uncorrected 






corrected 










n-alkane 


iso-alkane 


neo-alkane 


all 


n-alkane 


iso-alkane 


neo-alkane 


all 


Functional 


Basis set 


^6 


(6 species) (4 species) (3 species) (13 species) 


(6 species) (4 species) (3 species) (13 species) 


VWN5 


pc-2 


-0.25 


0.38 


0.80 


0.90 


0.68 


0.61 


0.98 


1.03 


0.86 


PBE 


pc-2 


0.75 


3.35 


5.53 


6.12 


4.88 


0.46 


0.31 


0.39 


0.40 


HCTH407 


pc-2 


1.10 


4.42 


8.18 


9.16 


7.09 


0.19 


0.49 


0.72 


0.46 


BLYP 


pc-2 


1.20 


4.99 


8.34 


9.24 


7.33 


0.37 


0.22 


0.17 


0.29 


TPSS 


pc-2 


1.00 


4.94 


7.74 


8.47 


6.87 


1.09 


0.84 


0.84 


0.96 


M06-L 


pc-2 


0.20 


3.29 


4.45 


4.75 


4.07 


2.52 


3.07 


3.22 


2.88 


PBEO 


pc-2 


0.70 


3.39 


5.55 


6.03 


4.87 


0.69 


0.67 


0.67 


0.68 


B3PW91 


pc-2 


1.10 


4.53 


7.48 


8.16 


6.56 


0.29 


0.32 


0.33 


0.31 


X3LYP 


pc-2 


0.85 


4.13 


6.87 


7.54 


6.03 


0.86 


0.93 


1.03 


0.93 


B97-3 


pc-2 


0.90 


4.65 


7.60 


8.32 


6.69 


1.19 


1.32 


1.43 


1.29 


B3LYP 


pc-2 


1.05 


4.52 


7.52 


8.26 


6.60 


0.48 


0.25 


0.25 


0.37 


B97-2 


pc-2 


1.05 


4.65 


7.81 


8.55 


6.83 


0.61 


0.50 


0.52 


0.55 


B97-1 


pc-2 


0.65 


3.93 


6.30 


6.90 


5.57 


1.43 


1.76 


1.92 


1.67 


TPSSh 


pc-2 


0.90 


4.79 


7.54 


8.21 


6.68 


1.33 


1.29 


1.33 


1.31 


TPSR1KCTS 


pc-2 


0.90 


4.76 


7.67 


8.42 


6.78 


1.30 


1.40 


1.53 


1.39 


PW6B95 


pc-2 


0.50 


3.43 


5.18 


5.66 


4.64 


1.50 


1.70 


1.84 


1.65 


BMK 


pc-2 


0.65 


3.34 


4.69 


4.92 


4.22 


0.84 


0.44 


0.29 


0.63 


M06-2X 


pc-2 


0.06 


1.35 


1.71 


1.62 


1.55 


1 12 


1.29 


1.16 


1.19 


M06 


pc-2 


0.25 


1.74 


1.88 


1.52 


1.75 


0.78 


0.41 


0.44 


0.60 


B1B95 


pc-2 


0.75 


3.86 


5.93 


6.46 


5.28 


0.97 


0.76 


0.74 


0.85 


B2K-PLYP 


pc-3 a 


0.30 


1.76 


2.73 


2.82 


2.39 


0.61 


0.64 


0.52 


0.60 


B2GP-PLYP 


pc-3 a 


0.40 


2.08 


3.28 


3.44 


2.87 


0.54 


0.49 


0.38 


0.49 


B2T-PLYP 


pc-3 a 


0.48 


2.36 


3.76 


3.99 


3.30 


0.51 


0.42 


0.32 


0.44 


B2-PLYP 


pc-3 a 


0.55 


2.60 


4.19 


4.49 


3.68 


0.49 


0.37 


0.28 


0.41 


mPW2-PLYP 


pc-3 a 


0.40 


2.71 


4.37 


4.68 


3.83 


1.17 


1.58 


1.62 


1.43 


B2K-PLYP 


pc-2 b 


0.30 


1.79 


2.65 


2.74 


2.34 


0.64 


0.56 


0.44 


0.57 


B2GP-PLYP 


pc-2 b 


0.40 


2.11 


3.21 


3.38 


2.83 


0.57 


0.44 


0.32 


0.48 


B2T-PLYP 


pc-2 6 


0.48 


2.39 


3.70 


3.94 


3.27 


0.54 


0.38 


0.27 


0.44 


B2-PLYP 


pc-2 6 


0.55 


2.64 


4.15 


4.46 


3.66 


0.53 


0.35 


0.25 


0.42 


mPW2-PLYP 


pc-2 6 


0.40 


2.73 


4.32 


4.64 


3.80 


1.19 


1.53 


1.57 


1.40 


MP2 


pc-3 


-0.15 


0.51 


1.24 


1.66 


1.12 


0.08 


0.21 


0.52 


0.28 


MP2-SCS 


pc-3 


0.17 


1.18 


1.52 


1.38 


1.35 


0.53 


0.37 


0.10 


0.41 


MP2 


pc-2 


-0.15 


0.24 


1.09 


1.37 


0.92 


0.34 


0.21 


0.23 


0.28 


MP2-SCS 


pc-2 


0.17 


1.36 


1.57 


1.50 


1.47 


0.71 


0.48 


0.19 


0.55 



"(frozen core) pc-3 basis set combined with a CBS extrapolation where Nmin=15 as recommended in Ref. [9j 
''(frozen core) pc-2 basis set combined with a CBS extrapolation where Nmin=10 as recommended in Ref. [9( 
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